Fuzzy Clustering for Finding Fuzzy Partitions of Many-Valued Attribute Domains in a Concept Analysis Perspective

نویسندگان

  • Yassine Djouadi
  • Basma Alouane
  • Henri Prade
چکیده

Although an overall knowledge discovery process consists of a distinct pre-processing stage followed by the data mining step, it seems that existing formal concept analysis (FCA) and association rules mining (ARM) approaches, dealing with many-valued contexts, mainly focus on the data mining stage. An “intelligent” pre-processing of input contexts is often absent in existing FCA/ARM approaches, leading to an unavoidable information loss. Usually, many-valued attribute domains need to be first fuzzily partitioned. However, it is unrealistic that the most appropriate fuzzy partitions can be provided by domain experts. In this paper, an unsupervised learning stage, based on Fuzzy C-Means algorithm, is proposed in order to get fuzzy partitions that are faithful to data for quantitative attribute domains, and consequently for avoiding the loss of valuable association rules due to the use of empirical fuzzy partitions. More precisely, the paper reports an experiment where it is shown that some rules are no longer found because their support or confidence is too low when using such empirical partitions. Experimental results show that the learned fuzzy partition outperforms human expert fuzzy partitions. More generally, the paper provide discussions about the handling of many-valued attributes in both fuzzy FCA and fuzzy ARM. Keywords— Many-valued formal contexts, fuzzy partitions, fuzzy C-means, association rules.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Framework for Optimal Attribute Evaluation and Selection in Hesitant Fuzzy Environment Based on Enhanced Ordered Weighted Entropy Approach for Medical Dataset

Background: In this paper, a generic hesitant fuzzy set (HFS) model for clustering various ECG beats according to weights of attributes is proposed. A comprehensive review of the electrocardiogram signal classification and segmentation methodologies indicates that algorithms which are able to effectively handle the nonstationary and uncertainty of the signals should be used for ECG analysis. Ex...

متن کامل

Determining Fuzzy Sets for Quantitative Attributes in Data Mining Problems

The problem of mining association rules for fuzzy quantitative items was introduced and an algorithm proposed in [5]. However, the algorithm assumes that fuzzy sets are given. In this paper we propose a method to find the fuzzy sets for each quantitative attribute in a database by using clustering techniques. We present a scheme for finding the optimal partitioning of a data set during the clus...

متن کامل

A Fuzzy C-means Algorithm for Clustering Fuzzy Data and Its Application in Clustering Incomplete Data

The fuzzy c-means clustering algorithm is a useful tool for clustering; but it is convenient only for crisp complete data. In this article, an enhancement of the algorithm is proposed which is suitable for clustering trapezoidal fuzzy data. A linear ranking function is used to define a distance for trapezoidal fuzzy data. Then, as an application, a method based on the proposed algorithm is pres...

متن کامل

Assessment of distance-based multi-attribute group decision-making methods from a maintenance strategy perspective

Maintenance has been acknowledged by industrial management as a significant influencing factor of plant performance. Effective plant maintenance can be realized by developing a proper maintenance strategy. However, selecting an appropriate maintenance strategy is difficult because maintenance is a non-repetitive task such as production activity. Maintenance also does not leave a consistent trac...

متن کامل

Arithmetic Aggregation Operators for Interval-valued Intuitionistic Linguistic Variables and Application to Multi-attribute Group Decision Making

The intuitionistic linguistic set (ILS) is an extension of linguisitc variable. To overcome the drawback of using single real number to represent membership degree and non-membership degree for ILS, the concept of interval-valued intuitionistic linguistic set (IVILS) is introduced through representing the membership degree and non-membership degree with intervals for ILS in this paper. The oper...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009